Military Speech Communications over Vocoders in Tandem
نویسندگان
چکیده
Speech intelligibility of two types of vocoders was measured using the modified rhyme test. One type of vocoder, a continuous variable slope delta (CVSD), was a waveform encoder. The other type, an advanced multi-band excitation (AMBE), was a parametric encoder. In the first experiment, clear speech was processed through the vocoders. Intelligibility was measured in a control condition, i.e. without vocoding, with each type alone and with two vocoders in tandem. AMBE and CVSD performed similarly, 92.6 and 90.4%, respectively. CVSD-to-AMBE had little effect on intelligibility, measured at 89.2%. However, AMBEto-CVSD had a large degrading effect on intelligibility. The AMBE-to-CVSD direction scored about 81.7% intelligibility with clear, unaltered speech signals. The asymmetry between waveform-to-parametric and parametric-to-waveform encoders underscores the non-linear nature of tandem vocoders on intelligibility. When vocoders of the same type were in tandem, there was no additional effect on intelligibility. The double CVSD condition yielded 92.2% intelligibility and the double AMBE condition yielded 91%. The deleterious effects of speech clipping were measured in a second experiment, as these are ubiquitous in military radio transmission systems. The AMBE parametric vocoder performed at the 88% level in isolation and at 84% when tandemed with the CVSD waveform vocoder. Alternative methods of encoding speech signals are being explored to improve speech intelligibility performance in military communication systems.
منابع مشابه
Adaptive Noise Reduction in Aircraft Communication Systems
In many military environments, such as fighter jet cockpits, the increasing use of digital communication systems has created a need for robust vocoders and speech recognition systems. However, the high level of ambient noise in such environments makes vocoders less intelligible and makes reliable speech recognition more difficult. One method of enhancing the noise-corrupted speech is adaptive n...
متن کاملLow-bit-rate Speech Coding
Low-bit-rate speech coding, at rates below 4 kb/s, is needed for both communication and voice storage applications. At such low rates, full encoding of the speech waveform is not possible; therefore, low-rate coders rely instead on parametric models to represent only the most perceptually-relevant aspects of speech. While there are a number of different approaches for this modeling, all can be ...
متن کاملGeneral outline of HF digital radiotelephone systems
considering a) that voice communications in the HF band use 3 kHz channels; b) that security is essential for some communications; c) that scrambling is the only means of obtaining a sufficient level of security; d) that the required level of security can easily be achieved using digitized speech technology; e) that there is therefore a need for speech signal coders (vocoders) associated with H...
متن کاملDVSI Application of Vocoders
The need for increased utilization of available wireless communication spectrum has fueled the development of voice coding technology. From simple waveform coding techniques operating at 64 kbps, the advance of speech coding algorithms has produced communication quality systems at 2 kbps and below. This allows up to 32 communications channels to operate in the bandwidth formerly occupied by one...
متن کاملIncorporating Envelope Information for Low Bit Rate Vocoders
This paper presents an approach of incorporating speech envelope in low bit rate speech compression algorithm. The speech envelope is extracted by a specially designed peak-picking and interpolation scheme and quantified with 16 models that were acquired by a cluster analysis method. Experiments showed an improvement through traditional vocoders.
متن کامل